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RAW SEQUENCE LISTING DATE: 07/22/2004 

PATENT APPLICATION: US/10/807,204 . TIME: 10:25:19 

Input Set : A:\54720-8015.US00-SEQLIST.TXT 
Output Set: N:\CRF4\07222004\J807204.raw 

4 <110> APPLICANT: Bougueleret, Lydie 

5 Bairoch, Amos 

6 Niknejad, Anne 

8 <120> TITLE OF INVENTION: Engineered Human Kunitz-Type Protease 

9 Inhibitor 
11 <130> FILE REFERENCE: 54 72 0 - 8 0 1 5 . US00 

13 <140> CURRENT APPLICATION NUMBER: US 10/807,204 

14 <141> CURRENT FILING DATE: 2004-03-22 

16 <150> PRIOR APPLICATION NUMBER: PCT/EP03/0162 9 

17 <151> PRIOR FILING DATE: 2003-02-18 

19 <150> PRIOR APPLICATION NUMBER: US 60/358,683 

20 <151> PRIOR FILING DATE: 2002-02-21 
22 <160> NUMBER OF SEQ ID NOS : 16 

24 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

26 <210> SEQ ID NO: 1 

27 <211> LENGTH: 131 

2 8 <212> TYPE: PRT 
29 <213> ORGANISM: Homo sapiens 

31 <220> FEATURE: 

32 <221> NAME/KEY: VARIANT 

33 <222> LOCATION: (1)...(131) 

34 <22 3> OTHER INFORMATION: eppin-like precursor 

3 6 <221> NAME / KEY : SIGNAL 

37 <222> LOCATION: (1) . . . (25) 

38 <223> OTHER INFORMATION: predicted by SignalP 2.0 
W--> 40 <221> PEPTIDE 

41 <222> LOCATION: (26)... (131) 

42 <223> OTHER INFORMATION: mature peptide 
W--> 44 <221> DOMAIN 

45 <222> LOCATION: (77)... (127) 

46 <223> OTHER INFORMATION: Kunitz domain predicted by pfscan 
W--> 48 <221> DISULFID 

49 <222> LOCATION: (33)... (61) 

50 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 52 <221> DISULFID 

53 <222> LOCATION: (40)... (65) 

54 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 56 <221> DISULFID 

57 <222> LOCATION: (48)... (60) 

58 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 60 <221> DISULFID 

61 <222> LOCATION: (54)... (69) 

62 <223> OTHER INFORMATION: predicted disulfide bond 
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W-~> 64 <221> DISULFID 

65 <222> LOCATION: (77)... (127) 

66 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 68 <221> DISULFID 

69 <222> LOCATION: (86)... (110) 

70 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 72 <221> DISULFID 

73 <222> LOCATION: ( 102 ) . . . ( 12 3 ) 

74 <223> OTHER INFORMATION: predicted disulfide bond 
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<223> OTHER INFORMATION: mature form 
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<223> OTHER INFORMATION: Kunitz domain 


predicted by pfscan 





W--> 110 <221> DISULFID 

111 <222> LOCATION: (8) . . . (36) 

112 <223> OTHER INFORMATION: predicted disulfide bond 
W~-> 114 <221> DISULFID 

115 <222> LOCATION: (15)... (40) 

116 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 118 <221> DISULFID 

119 <222> LOCATION: (23)... (35) 

120 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 122 <221> DISULFID 
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123 <222> LOCATION: (29)... (44) 

124 <2 23> OTHER INFORMATION: predicted disulfide bond 
W--> 126 <221> DISULFID 

127 <222> LOCATION: (52)... (102) 

128 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 130 <221> DISULFID 

131 <222> LOCATION: (61)... (85) 

132 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 134 <221> DISULFID 

135 <222> LOCATION: (77)... (98) 

136 <223> OTHER INFORMATION: predicted disulfide bond 
W--> 138 <400> 2 
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154 <210> SEQ ID NO: 3 

155 <211> LENGTH: 11201 

156 <212> TYPE: DNA 

157 <213> ORGANISM: Homo sapiens 

159 <220> FEATURE: 

160 <221> NAME/KEY: misc_f eature 

161 <222> LOCATION: (1) ... (11201) 

162 <223> OTHER INFORMATION: public sequence AL031663.2 ( 94000 .-. 105200) 

164 <221> NAME /KEY : CDS 

165 <222> LOCATION: (2043 ) . . . (2 132 ) 
W--> 167 <221> CDS 

168 <222> LOCATION: (3358) ... (3486) 

W--> 170 <221> CDS 

171 <222> LOCATION: (6120 )...( 6191) 

W--> 173 <221> CDS 

174 <222> LOCATION: (6193 )...( 6291 ) 

W--> 176 <221> miscf eature 

177 <222> LOCATION: ( 62 91 ) . . . ( 62 93 ) 

178 <223> OTHER INFORMATION: potential Stop codon 
W-~> 180 <221> 3'UTR 

181 <222> LOCATION: ( 62 94 ) . . . ( 723 6 ) 

182 <223> OTHER INFORMATION: partial 
W--> 184 <221> polyA signal 
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185 <222> LOCATION: ( 72 3 1 ) . . . ( 723 6 ) 

187 <400> 3 

188 tttaaagaat catgtaaaac aggcttacaa attttctttt taaaaatatt ttattttatt 60 

189 ttacgttccg ggatacatgt gcaggacctg caggtttgtt acataggtaa gcctgtgcca 120 

190 tggtggtttg ctgcacctat caacccatta cctgggtatt aagccccgca tgcattagct 180 

191 atttatcctg atgctttccc tccccctgcc ccctcaacag tccacgtgtt ctcattgttc 240 

192 atctcccact tgcaagtgag aacatgtggt atttactttt gttcctgcgt tactttgctg 300 

193 aggataacgg cttccagctt catccatgtc cctgcaaagg acgtgatctc atttcttttt 360 

194 atggctgcat agtattccat ggcatatatg taccacattt tctttatcca gcctatcgtt 420 

195 gatggatatt tggtttgatt ccatgttttt gctattgtaa atagtgctgc agtgaacata 480 

196 cacatgcatg tatctttata aaagaatgat ttatattcct ttgagtgtat acccagtaat 540 

197 gggattgctg ggtcaaatgg tatttctggt tctagatctt tcaggaattg gcacactctc 600 

198 ttccacaatg gttgaaccaa tttacattcc caacagtgta aaagcattca tgtttctcca 660 

199 cagctttacc agcatctgtt gtttctggac tttttaataa tcaccattct gactggcatg 720 

200 agatggtatc tcattgtggt tttgatttgc atttctctga tgatcactgg tgttgagctt 780 

201 tttttcatat gtttgttggc cgcataaatg tcttcttttg agaagtgtct gttcatgtcc 840 

202 tttgcccact tttcaatggg gttgtttgtt ttttttcttg taaatttgtt taagttcctt 900 

203 gtagattctg gatatcagac cacaaatttt cttaatctgt ctccagtgat gggaaagcta 960 

204 agctatttgt ccactgattc ccaacgccca ctggatgcca gggatactaa atccctcgct 1020 

205 ctttcaaact gttatacttg caagctatac aagctccctg ggggccagag aaaacccaca 1080 

206 gcagaaaatt agggactgat gctagaggta ggaggctgtc aatgtgcatt ggctattatt 1140 

207 cggccataaa aaggaatgaa cttctcactc accactgatg acatgccttg ctgcacacac 12 00 

208 accctgctgc caccctgcag aaatgcttec attacccaca gtcctttgcc agatggaact 1260 

209 gatgcccagg taactggctc ctcacctcct tcgggcttat accaaggata taaaacttgg 1320 

210 tacagatgct caagcgttaa tgcttcaggg tgtgactttt aatccaatgc tgtagctgtg 1380 

211 acaacaggga aaaacggaga acagagttgg agaagtccca aagtaatgaa agaagttatt . 1440 

212 tactgttgca aaatcaaacg acttaaagga taaatgtaac tgatctttac atacgaaaga 1500 

213 agagagaggg aagtgaaggg cggggagaaa gagactcaga cagacagaga ctccctgaga 1560 

214 aaaagacagg cacagagtaa aagacataca gatgcacaga aaagagaaga aacagagtga 162 0 

215 gaaaaataaa acataagaat agacatgtca ggagaagagt gaaattagaa agaaggaaga 1680 

216 gagctagcat actgttaacc tgcaaactcc ccctgggaaa tggaagactc tggagttaaa 1740 

217 cctccccttc tgctaaggga gcctggaaag cccaagacaa catgtgccct ctttccctgg 1800 

218 tagaccagaa agcctcagat tgctgaggat taggagccac ctatccccca gtgtaggtac 1860 

219 caggtagtcc cactgatggc ctcagggatc tggtgttcct gatcggggag tctggctggc 192 0 
22 0 ttgtctccag ggctgtctct taaggtgctg ggccacaccg gggcaggcaa agtgcagatg 1980 

221 taggagctgc tgaggagcag ttcttagttt tggtgccatc aaccaggcca gtacttccta 2040 

222 aa atg gga etc tea gga ctt ctg cca ate ctg gta cca ttc ate ctt 2087 

223 Met Gly Leu Ser Gly Leu Leu Pro lie Leu Val Pro Phe lie Leu 

224 15 10 15 

226 ttg ggg gac ate cag gaa cct ggg cac get gaa ggc ate ctt ggc 2132 

227 Leu Gly Asp lie Gin Glu Pro Gly His Ala Glu Gly He Leu Gly 

228 20 25 30 

230 agtaagtatt gggaactccg tecttgeaat ggggaaagga aactcagtgg ggaggaagga 2192 

231 acaaagaata attttgeage tgagaaatta ttttctccac ctctgcctgg aattccaggt 2252 

232 gggaaagagg ggagctattt gaagtctctt ggtaagagac tcccttggtg aatacaggca 2312 

233 tccaaatctg tggcaagtga ctccttgctg ccattccaaa gagttcttcc tggaggtgcc 2372 

234 tggacctaga gctcatgaaa tgaactaaga aggagtagta ageccaagga agtctcccac 2432 

235 aagaaaaaca ggctgctccc cacaacacca cctgtgtcaa cttccaaatg ttctcattgc 2492 

236 agcccccaca cccataccca cagaggaccc tggtctttgg gttaaagaat ggtctccaga 2552 
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237 agtcccttgg aatgctgaaa tgtgaagact cacaattagc agctaactgt actgggcact 2612 

238 gaaaattagt ggttgaaagt gagcactcag gagccacaca ttgtaggttc aaacctggct 2672 

239 ccacccctga gtagctgtgt gatcttgggc aaattcacat aatctcctta tgcttccaat 2732 

240 tcttctgttt acaaaatgtg aatattaaga gtacttaccc atagatttgt aatgaggatt 2792 

241 aaggagaaaa ggcagctcac agtaagtgct tcatccatat tactcattgt tacgttacat 2852 

242 tgccccaggc agtttgacga tggcttgtca aactgtgttg ctggttagtc ttcaaagcaa 2912 

243 ccctatgggc aaatgaggaa atcaaggctc agagagagtg cctggcacac caagcgtctc 2972 

244 acagaattca aaagcccagc tatctgatgc cagaaatgtt aatctcaatc atcgagagcc 3032 

245 acagcacctc ctgggggagc aaccaagcac agcaaggctg agtgaccaga acagtcttag 3092 

246 aggaatgcgt ctttctctca ggaatctcat agatggtgcg ggtagggggt aggggttggt 3152 

247 ttaggcatcc gatggtgagt tgttatgggg ttggggatgt gagcagtgtt gtgaaattcc 3212 

248 tgagccccct gagggccact accttctggg gataggtgaa gtggaaacaa agttctctaa 3272 

249 ctctggaggg gtgccctggt ctcggggagg g'ggagcactg gctcctttaa ggccttgagg 3332 

250 gaatatatct tcccctccct tagag ccg tgt ccc aaa ate aaa gtg gaa tgc 3384 

251 Pro Cys Pro Lys lie Lys Val Glu Cys 

252 35 

254 gaa gtg gaa gaa ata gac cag tgt acc aaa ccc aga gat tgc cca gaa 3432 

255 Glu Val Glu Glu lie Asp Gin Cys Thr Lys Pro Arg Asp Cys Pro Glu 

256 40 45 50 55 

258 aac atg aag tgt tgc ccg ttc age cgt gga aag aaa tgt tta gac ttc 3480 
25 9 Asn Met Lys Cys Cys Pro Phe Ser Arg Gly Lys Lys Cys Leu Asp Phe 
260 60 65 70 

262 aga aag gtaactcaga tgcttcctaa attacccagt gccctcacct cctatctcca 3536 

263 Arg Lys 

266 cctgcactgc actacctctg tctcctagtt cactgatggc tggtctctga ccaagagttt 3596 

267 aggtacatct tctacttgac acaaatctgc cagtcctgga cttgggagac atggatttta 3656 

268 tttcatttct tcctctattt tggtaagtga tgtgaacaat gtagttttcc tttctgggcc 3716 

269 tcagtttccc tttccataag acaggtgtgg ataatggacc acatggtctc ttggctactc 3776 
2 70 ttcagtgatt etcgagatge caagtagtca ccaaagtctc ateaaatgee aacattgtgt 3836 

271 caggttatat gatagctata agagacaggt atataatctg tatttcctgg cttttttctc 3896 

272 taggcctctc tgtctacata aatccaggat gccttccctg accaccttag accgcctaca 3956 

273 ctctttcttc tctgcccata ttattgagct ctaaagaaga tcatcaagga ccattcgttc 4016 

274 attctattca ttcaacaaat attgattggt gcccctgaca cccatgatgg gttccacttg 4076 

275 ctcattgtgg ccaggaaggt cagaaaagtt tttctgggaa agagatccat gatttaagat 4136 
2 76 ctgaaagatg agtacttaga ccaaaaatgg gaagaaatag cctttcaggc cagataatag 4196 

277 gatgtgcaaa ggctctgtgg caggaggcag gatgttcagg tctagaaagg ctagaaagca 4256 

278 gagagtgagc aagagacaag aaacaggaaa ctggtagtgt atattcaggg caacttgaat 4316 
2 79 ctgtattccc aagtggccat cctcaagttt tggggtcaaa taaatgeect acttaatgat 4376 

280 attttctgaa tcttgttact ttaggttgac atacctttca agacaataac taggttttca 4436 

281 gtgetttgat aaaacctcaa atgtgtggcc ttccagccag gcatctgtgg tttcatattg 4496 

282 tttaagcatt tgttgcatcc ctatggattg cagctctcta taggttcget gttgetgaac 4556 

283 caagaaataa ggcatacaca gatttctaga gaaattgtaa ttctataaca gattttcagg 4616 

284 agatgaggaa ggctggtatc tttcagagta caaagtgatt ttggaattga aagaatttct 4676 

285 ttgggttaaa gtttgtcact gacttgtgtt cctgaaccat gaagcatgaa tatgtgggct 4736 

286 gagaaatagt ttctcttgat aaataaacaa gtaacaaatt ttaaaaagag aggaagatga 4796 

287 agctgggcta gttgggggaa ggagagctgg taagggatct ggtcttgatc ctaacagcaa 4856 

288 tggattaacc tatgttcaaa tcccttacca tctgtgtggc ctcaagaaag ttatttagee 4916 

289 tctctaaacc ttagttttct catctgeata atgagaacaa tgagaggacc tactttacaa 4976 
2 90 gattaaatgt taatagatgt caactactaa gaacactaat caatgetata attgttgttt 5036 
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L:40 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID# : 1 

L:44 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 

L:48 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 

L:52 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 

L:56 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID# : 1 

L:60 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID# : 1 

L:64 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID# : 1 

L:68 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID# : 1 

L:72 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID# : 1 

L:76 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 
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